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Abstract — We consider a MIMO fading broadcast channel 
where channel state information is acquired at user terminals 
via downlink training and channel feedback is used to provide 
transmitter channel state information (CSIT) to the base station. 
The feedback channel (the corresponding uplink) is modeled as 
an AWGN channel, orthogonal across users. The total bandwidth 
consumed is the sum of the bandwidth/resources used for down- 
link training, channel feedback, and data transmission. Assuming 
that the channel follows a block fading model and that zero- 
forcing beamforming is used, we optimize the net achievable 
rate for unquantized (analog) and quantized (digital) channel 
feedback. The optimal number of downlink training pilots is 
seen to be essentially the same for both feedback techniques, but 
digital feedback is shown to provide a larger net rate than analog 
feedback. 

I. Introduction 

We consider a MIMO Gaussian broadcast channel modeling 
the downlink of a system where a Base Station (BS) has Nt 
antennas and serves K single-antenna User Terminals (UTs). 
A channel use of such system is described by 



Vk = hfe X 



Zfc, = 1, . 



(1) 



where is the channel output at UT k, Zk ~ C3\f(0, Nq) is 
the corresponding Additive White Gaussian Noise (AWGN), 
hfe G Cf is the vector of channel coefficients from the BS 
antenna array to the fc-th UT antenna and x is the vector of 
channel input symbols transmitted by the BS, subject to the 
average power constraint E[|xp] < P. We use p to denote 
the nominal SNR: p = ^. We assume a block fading model, 
i.e., the channel remains constant over a coherence interval 
of T channel uses. Albeit suboptimal, we focus on zero- 
forcing (ZF) beamforming with K — Nt users for its analytical 
tractability. 

In order to perform zero-forcing beamforming (or any other 
multi-user MIMO strategy), the BS must have an accurate esti- 
mate of the channel to each UT. Such information is generally 
acquired in a two-step process: each UT first estimates its own 
downlink channel during a common downlink training phase, 
after which each UT transmits its channel estimate over a 
feedback channel (on the corresponding uplink) to the BS. 
The rates achievable with zero-forcing beamforming depend 
critically on the quality of the CSI available to the BS, but 
high quality CSI can only be achieved if (a) each UT is able 
to accurately estimate its own channel (i.e., by using a suitably 
long downlink training phase), and (b) the process of channel 



feedback does not add too much additional distortion to the 
channel information. 

In this work, we attempt to determine the optimum fraction 
of resources that should be dedicated to training and feedback 
with the criterion being the net spectral efficiency achievable 
on the downlink. Training consumes downlink bandwidth 
while channel feedback consumes uplink bandwidth. From 
a high-level system perspective, increasing either training or 
feedback effectively takes away from the bandwidth available 
for actual data transmission. The net spectral efficiency is 
therefore the transmission rate reflecting the overhead due to 
the downlink bandwidth consumed for training and the uphnk 
bandwidth used for channel feedback. 

We utilize our earlier work in which the downlink spectral 
efficiency was tightly bounded as a function of the amount 
of training and feedback [1]. This allows the net spectral 
efficiency per UT, accounting for training/feedback resources, 
to be accurately lower bounded as 



> 1 - 



Ti + Tfb \ / 2F -t-f;- 



T 



R^^ - AR) 



(2) 



where Ti and Tfb are the number of channel symbols used 
for training and feedback, respectively, per coherence block 
of length tQ The quantity R^^ denotes the rate achievable 
with ideal CSI while AR is the rate gap due to imperfect 
CSI. This rate gap depends on the particular feedback strategy 
used, as specified in [1], but is decreasing in both Ti and Tfb. 
The objective of this paper is to maximize the net spectral 
efficiency with respect to Ti and Tfb for the cases of analog 
and digital feedback, and to understand how the optimal 
values of Ti and Tfb as well as the optimized net spectral 
efficiency depend on various system parameters of interest 
(e.g., blocklength T, signal-to-noise ratio, and Nt). 

The present work is an extension of [2], where the same 
optimization was investigated for the case of analog (unquan- 
tized) feedback over a shared MIMO MAC feedback channel. 
On the other hand, here we consider both analog and digital 
feedback techniques and focus primarily on an AWGN model 
for the feedback channel. Interested readers can refer to [2] 
for a discussion of other prior work on this general topic. 

' Using Ti and Tfj, symbols per length T block is fully equivalent to using 



a fraction ^ and 



of the total bandwidth for training and feedback. 



II. Channel State Estimation and Feedback 

In this section we describe the basic training and channel 
feedback scheme that allows CSI to be acquired. 

1) Common downlink training: Ti shared pilot symbols 
(essentially ^ pilots per BS antenna) are transmitted to allow 
all UTs to estimate their downlink channel vectors {hfc} based 
on the observation 



Sfc 



TiP 
1^ 



(3) 



where Zfc - 6^(0, iVoI). Each UT performs linear MMSE of 
hfc from the observation Sk, which results in a per-coefficient 
estimation error with variance [1, Equation 7] 



1 



1 



(4) 



P 



2) Channel feedback: Each UT feeds back its channel 
estimation immediately after the training phase. We focus on 
the scenario where the feedback channel is modeled as an 
AWGN channel with the same signal-to-noise ratio p, identical 
to the nominal downlink SNR. Because UT's are assumed 
to access the feedback channel orthogonally, a total of Tfb 
channel symbols translates into ^ feedback channel uses per 
UT. The different feedback strategies are described in Section 

From the feedback received from each of the UT', the BS 
obtains the channel estimate hi .... , h^^. . The imperfection 
in the CSI available to the BS stems from two sources: the 
channel estimation error during the common training phase, 
and the distortion incurred during the feedback phase. For 
analog feedback the distortion is due to additive noise in the 
feedback channel, while for digital feedback it consists of the 
quantization error as well as possible errors while transmitting 
bits over the feedback channel. 

If the beamforming vectors Vi,...,vn_^ are selected by 
using zero-forcing^ on the basis of the imperfect channel 
estimates hi, . . . , hTVt, the following per-UT rate is achievable 
if equal-power (across UT's) Gaussian inputs are used: 



E 



log 1 



2_P_ 



(5) 



assuming each UT is aware of its received SINR0 Imperfect 
CSI results in non-zero interference coefficients jhl'^Vjl, which 
in turn decrease the rate. In [1] it is shown that the rate in (|5]l 
is accurately lower-bounded by 

R^^ - AR (6) 

where R^^ is the rate achievable with perfect CSI and AR 
denotes the rate gap given by 



Nt 



(7) 



^Such knowledge can be acquired through an additional dedicated training 
round as discussed in [1]. This training round does not significantly affect the 
present work, and thus is ignored for the sake of simplicity. 



The rate gap depends on Ti, Tfb and the feedback strategy. 
Its closed-form expressions are found in [1] for the cases 
addressed in this paper 

III. Optimizing training and feedback 

We now consider the problem of interest, which is the 
maximization of the net spectral efficiency: 

Ti + Tfb^ 



max 

Ti,Tfb:Ti+Tfb<T 



1 



'-'^ (i?ZF-Ai?(Ti,Tfb)) 



(8) 

To facilitate solving this optimization, it is useful to write our 
problem as follows: 

Ti+Tfb^ 



max max | 1 

Tt<T Ti+Ttb=Tt 



T 



Ai?(Ti,Tfb)) 



Furthermore, we write the rate gap as follows: 

Ai?(Ti, Tfb) = log (1 + ff(Ti, Tfb)) 



(9) 



(10) 



where the function g{-) depends on the feedback strategy and 
is defined later Because the first multiplicative term is constant 
when Ti + Tfb = Tt , the inner maximization corresponds 
to minimization of the function g{-) subject to the constraint 

Ti + Tfi, < Tt: 



9{Tt. 



min g(Ti,Tfb), 



(11) 



while the second step is a maximization of the net spectral 
efficiency over Tt (the total training and feedback symbols): 

Tt 



max I 1 „ 

Tf. Tt<T V T 



R^^ -\og{l+g{Tt)). (12) 



In the following this two-step strategy is implemented for 
analog feedback, TDD systems with channel reciprocity, and 
digital feedback (with and without feedback channel errors). 

A. Analog Feedback 

We begin by considering unquantized analog feedback, 
whereby the complex amplitude of each discrete-time feedback 
symbol is chosen as the UT's estimate of each complex 
channel coefficient. Because each UT is allowed ^ feedback 
channel uses, this corresponds to feedback channel uses 
per channel coefficient (if Tfb > N^, each coefficient is 
effectively repeated ^ times on the feedback channel). This 

results in distortion that is inversely proportional to and 
the resulting rate gap is described as [1, Section IV]: 

±1 -tfb 

We begin by minimizing g{-) subject to a constraint on Ti + 
Tfb. For the sake of generality, we rewrite g{) as: 

^analo. 



g »(Ti,Tfb) = — + — . 



(14) 



where wi — Nt I and Wfb — Nt{Nt — 1). Therefore, the 
minimization to be solved is: 



Ti 



W2 



subject to Ti+Tfb<Tt. 



(15) 
(16) 



This is readily seen to be a convex optimization, and can be 
solved by forming the Lagrangian: 

i:(Ti, Tfb, fi) = g{Ti,Tn,) + ^(Ti + Tft) 

where /i > is the Lagrangian multiplier. The KKT condition 
yields the following solution 



(17) 



In terms of Tt, these can be written as 



(18) 



where we let 3C = 
is given by 



while the objective value 



X 



(19) 



It is clear that Tt is shared between training and feedback 
proportional to the square root of the weights wi and Wfh- 

Using (fT9] l. the overall optimization can now be character- 
ized in terms of a single variable Tt- Namely the second step 
of the proposed optimization corresponds to maximizing 



f(Tt) = 1 - 



T 



i?^^F-log 1 



Tt 



(20) 



Because / is concave in Tt, the optimal Tj* can be found by 
numerically solving for = where 



dl 
dTt 



3C(l-f) 



Ti 



1 

T 



- log ( 1 



% 
Tt 



(21) 



Although a closed-form solution for Tj* does not exist, it is 
possible to compute how this quantity scales with blocklength 
T. From ( 1211 1. the optimal Tj* satisfies the following equality 



3C(r - Tt) 



Tni + 



- 



log 



1+^ 

Tt 



(22) 



It is easy to see that the derivative in ( |2TI) is upperbounded by 
^f{Tt), where 



f{Tt) = 



X{T~ Tt) 
Ti 



X 
Tt 



(23) 



Since / is concave, it follows that the solution Tt of the 
equation f{Tt) = is an upper bound to the optimal value 
T*. Solving f{Tt) = we find 



n <Tt^ 



XT 



(24) 



Furthermore, when the rate gap is small such that 
log (^1 + ~ ^ (which becomes accurate for large T), 
the upperbound also becomes a very good approximation. 

The upperbound (l24l i yields two interesting behaviors: 1) 
for a fixed SNR (i.e., constant i?^^) T* increases as 0{VT) 
as T — > oo; 2) for a fixed coherence interval T, Tj* decreases 
as for large SNR, or equivalently, it decreases as 



Cl(l/^log(SNR)) since R^^ = log(SNR) + 0(1) for large 
SNR. 

In addition, an upper bound on T^ can be reached by 
combining (l24]i with (fTsl i: 



Tt 



wiT 



{Nt - 1)T 



(25) 



X^' V V i?ZF • 

According to this approximation, the optimal downlink train- 
ing is independent of Wfb, and thus of the efficiency of the 
feedback channel. 

Next, we examine the impact of Tt on the achievable rate. 
Using the upperbound (l24b into (|20|) . the objective value can 
be approximated as 



f{Tt) = 



1 - 



After some manipulation, it can be shown that the resulting 
effective rate gap with respect to R^^ is given by 



i?^^ - f{Tt) < i?^^ - f{Tt) 



T 



(26) 



Thus, the gap to a perfect CSI system decreases roughly as 
0{1/Vt) as T increases. 

B. Time-Division Duplexing 

The analysis from the previous subsection can also be used 
to optimize the amount of uplink training performed in a time- 
division-duplexed (TDD) system with perfect channel reci- 
procity (i.e., the downlink and uplink channels are identical)0 
Note that no feedback is necessary in such case. In [1, Remark 
4.2] the rate gap for a TDD system that uses Ttdd uplink 
training symbols ( '^™° per MS) is given: 



AR = log 

The optimization over Ttdd is 
Ttdd \ 



Nt-1 
Tt 



DD 



max I 1 



T 



i?^^ - log ( 1 



Nt 



Ttdd 



(27) 



(28) 



which is clearly equivalent to the optimization in ( l20l l with 
Tt = Ttdd and 3C = A'i — 1. As a result, the analysis and 
approximations from the previous subsection carry over. By 
adapting (l24l l we have 



^TDD — Ttdd 



{Nt - 1)T 



(29) 



which is the same as the approximation to T* (the number of 
downlink training symbols) for analog feedback in ( l25b . 

Based upon the expression for the approximate rate gap in 
(|26] |. by comparing the value of X for analog feedback and 
for TDD we see that the rate gap for analog feedback is a 
factor 1 + yWt larger than for TDD. 

For future reference it is also worthwhile to notice that the 
TDD setting corresponds to the non-TDD setting with perfect 

'Note that a similar optimization is considered in [3], althougli in tliat work 
analysis of this optimization is not performed. 



feedback (i.e., the BS knows the UT channel estimates, or 
equivalently wpB = in Section IIII-AI ). As a resuh, the net 
rate achievable with TDD serves as an upper bound to that 
achievable with training and channel feedback. 

C. Error-Free Digital Feedback 

We now analyze digital feedback techniques, whereby each 
UT quantizes its vector channel estimate to B bits and then 
maps these bits into ^ transmit symbols. For the quantization 
step we consider a family of random vector quantization 
(RVQ) schemes. Assuming the feedback bits are received 
error- free, in [1, Section V] it is shown that the rate gap is 



Ai? = log ( 1 + -j; — +p2 — 



(30) 



where the distortion error is expressed in terms of bits. In this 
section we assume unrealistically that error-free communica- 
tion is possible over the feedback channel at its underlying 
capacity of logj ^1 -|- bits per channel use. Each of 
the Nf UT's utilize ^ channel uses, and therefore B — 
^ Iog2 (1 + p). As a result, we obtain 



9 



digital 



(Ti,Tfb) 



(31) 



The first step is the minimization of the above function subject 
to the constraint Ti + Tfb < Tt. Since g'^'S't'*' is convex in 
Ti, Tfb, we form the Lagrangian and readily obtain 



mVAW (32) 
21n(M)+ln(^lH^) 



(33) 



ln(l + p) 

Note that the feedback length grows as O (In/i), much slower 
than the linear increase (in [i) for the common training. 

Contrary to the earlier analog feedback case, g'^'S'*'*' cannot 
be expressed as a closed form of Tt but instead must be 
expressed as a function of /i. However, for the sake of com- 
parison with analog feedback we perform this optimization in 
terms of Ti rather than /i. Based upon ( [32] i we can express 
Tfb as a function of T\. 



and thus the net spectral efficiency can be written as: 



(34) 



V 



ri + 7Vt(iVt-i) 



21n(Ti)+ln 



ln(l+p) 



i?^^ - log 1 



T 



1 



Nt{Nt - 1)2 



(35) 



Ti (ri)2 1n(l+p). 

Because Tfb increases logarithmically with Ti, its effect on the 
maximization is rather minimal. As a result, the maximization 
of Ti is very similar to the maximization of Ttdd in the TDD 
setting, which is in turn similar to the maximization of Ti in 
the presence of analog feedback. 



D. Digital Feedback with Errors 

Rather than assuming (unrealistically) that the feedback 
channel operates at channel capacity and error-free, in this 
section we analyze a system where uncoded QAM is used 
to transmit each UT's quantized channel vector over the 



feedback channel. Each UT utilizes 



feedback channel 



uses. Assuming that quantization bits are arbitrarily mapped 
to channel symbols, one or more symbol errors (among the 
^ channel uses) makes the feedback from a particular UT 
effectively useless and thus leads to a rate effectively of zero. 
Under this assumption, the achievable net rate is given by: 



T 



(1-Pe,fb)[i?f -Ai?] 



(36) 



where AR is defined in ( l30l l. Because each UT utilizes ^ 
complex channel symbols, the number of feedback bits per 
user B — log2 M where AI is the number of constellation 
points. The per-symbol QAM error probability is given by 



P. 



1 



1-21 



1 



^ I M- 1 



(37) 



while the probability of a feedback error, Pe.ih, is the proba- 
bility that any of the symbols are received incorrectly: 



P. 



e,tb 



1-(1-P,) — 



(38) 



In order to allow for a two-step optimization, we rewrite 
the objective in (|36] | as: 

Tt' 



1 - ^ ) [rF - hiTt)] 



(39) 



where the effective rate-loss h{Tt) incorporates the loss due 
to feedback error is defined as: 



h{Tt) = min w(ri,Tfb) 

Ti,Tfb:Ti+Tfb<Tt 



(40) 



with 



u;(ri,Tfb) = (l-Pe,fb) log! 1 + ^— +pAf-rf^' 



+Pe,fbi?r- 



(41) 



If a reasonable constellation size is used, the probability 
of feedback error is quite small even when the number of 
feedback bits per users is relatively large (e.g., for Nt — A 
at 10 dB with B = 25 and 4-QAM, Pe,fb = 0.038). As 
a result, the minimization of ti;(ri,Tfb) is very similar to 
the minimization of g'^'S'*''\Ti,Tih) for error-free feedback in 
( |3TT l, but with a constellation of size M rather than 1+p. When 
computed numerically, an optimization over the constellation 
size is also performed. 

IV. Numerical Examples & Discussion 

This section provides some numerical examples to illustrate 
the analysis of the previous section. The system parameters, 
unless otherwise specified, aie Nt = 4 and p = 10 dB. 
In all cases the previously stated optimizations have been 
numerically computed with Ti and Tfb restricted to integers. 
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Fig. 1. Optimum number of feedback symbols (Tfi,) versus total training + 
feedback (Tt). 



Fig. 2. Optimum number of feedback symbols (Tfj,) and training symbols 
(Ti) versus blocklength (T). 



subject to the constraints T\ > Nt (to ensure at least one 
training symbol per channel coefficient) and Tfb > for 
analog feedback (one feedback symbol per channel coefficient) 
and Tfb > Nt for digital feedback (one FB symbol per UT). 

In Fig. [U the optimum number of feedback symbols (Tfb) 
is plotted versus Tt, the total training & feedback budget, for 
analog feedback, digital feedback, and digital feedback with 
errors (uncoded QAM). For analog feedback the number of 
feedback symbols grows linearly with Tt with slope Tfb = 
Y^^£=, while for digital feedback Tfb increases with Ti at 
a much slower rate (approximately logarithmically). The rate 
of increase for Tfb is particularly slow beyond Tt = 100. At 
this point digital feedback without errors corresponds to 10 
symbols per user and thus nearly 35 bits (B). At this point the 

B 

distortion due to quantization (2 ™t-i) is less than 10 and 
the gains in increasing B beyond this point are very negligible. 
Even when feedback using uncoded 4-QAM is considered, 
each user is quantizing to 28 bits at Tt = 100. The abrupt shift 
for digital feedback with errors occurs when the constellation 
changes from 4-QAM to BPSK: when the number of feedback 
symbols becomes too large (when Tt is sufficiently large) 
the probability of feedback error becomes significant and it 
becomes more efficient to reduce this error probability by 
reducing the constellation to BPSK while keeping the number 
of bits (B) nearly the same. This is a consequence of using 
uncoded transmission on the FB channel. 

In Fig.|2]the optimal values of Ti and Tfb are plotted versus 
blocklength T for analog, digital, and digital w/ errors; Ttdd 
is also plotted for TDD. Most striking is the fact that the 
optimal values of Ti and the optimal Ttdd are essentially 
identical for the three feedback techniques as well as for TDD. 
Furthermore, although not shown here, t he optimi zing values 

of Ti are very well approximated by \f^^^^^ as in dZSl) . 
On the other hand, the number of feedback symbols depends 
critically on the feedback method. Because analog feedback 
is so inefficient, a large number of feedback symbols are used 
so that the rate gap due to feedback is not too large. On the 
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Fig. 3. Achievable sum rate vs. blocklength (T). 



Other hand, digital feedback is very efficient and a relatively 
small number of feedback symbols is required. 

In Fig.[3]the net achievable rate is plotted versus blocklength 
T. For analog and TDD the rate approximations based upon 
(i26] l are indicated with dotted lines and are seen to become 
increasingly accurate as T is increased. Analog feedback is 
outperformed by digital feedback, with or without errors, for 
all blocklengths. This is because digital feedback offers a 
significantly smaller distortion as compared to analog when- 
ever Tfb is larger than (approximately) (i.e., one symbol 
per channel coefficient) [1, Section VI], and for reasonable 
blocklengths it is optimal to use Tfb larger than (Fig. |2]i. 
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